LLM Benchmarks (China)

SuperCLUE (The Chinese Language Understanding Evaluation Benchmark CLUE (The Chinese Language Understanding Evaluation)) is a Chinese-developed benchmark originally launched in 2019 and updated since.

Jeff Ding translates

Key Takeaways: There is still a significant gap between GPT-4-Turbo (OpenAI’s best models) and LLMs from China’s top tech giants and start-ups — even for prompts and outputs in Chinese.

SuperCLUE benchmarks ranked